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1. 


INTRODUCTION 


Engineers  often  obtain  data  either  to  verify  empirically  a 
postulate  of  theory  or  to  measure  experimentally  a  relevant 
characteristic  of  a  phenomenon. 

Statistics  is  a  methodology  to  interpret  data  and  to  make 
conclusions  on  a  relevant  characteristic  from  data. 

There  are  basically  two  approaches  in  engineering  analysis; 
one  is  deterministic  and  the  other  is  probabilistic.  This  is 
illustrated  by  the  following  example. 

Example  1.1.  Suppose  that  we  are  going  to  hit  a  golf  ball  with 
an  initial  velocity  vQ  and  angle  a  and  to  measure  the  horizontal 
distance  x  that  it  travels. 

(a)  Deterministic:  Newton's  gravitational  law  gives 

2 

v  sm2a 

x  =  - - -  ,  where 

g  is  gravitational  constant. 

This  result  is  applicable  only  under  many  ideal  conditions. 

(b)  Probabilistic:  Under  the  real  condition,  in  which  there 
may  be  presented  irregular  effects  of  air  resistance,  wind 
speed,  moisture  content  of  air,  temperature  change,  etc. , 
the  deterministic  modeling  may  be  impossible.  One  of  the 
probabilistic  approaches  is  to  conduct  100  identical  exper¬ 
iments  of  hitting  a  golf  ball  and  study  their  scattering  and 
distribution  of  the  distances  that  they  traveled. 


2.  BASIC  DATA  ANALYSIS 

We  introduce  in  this  section  a  basic  technique  of  data  hand¬ 
ling. 


Definition  2.1.  The  collection  of  all  possible  measurements  of 
a  characteristic  is  called  a  population.  A  collection  of  actual 
measurements  on  the  characteristic  (actual  data)  is  called  a 
sample.  The  number  of  measurements  in  the  sample  is  called  the 
sample  size. 

A  population  can  be  infinite  or  finite.  A  sample  is  always 
finite. 

We  want  to  draw  statistical  conclusions  about  the  population 
characteristics  on  the  basis  of  a  sample  of  measurements  on  that 
characteristic. 

Let  us  denote  each  measurement  in  a  sample  by  x^,  x^f  .  .., 
xn»  and  let  x ^  ,  x(2)'  *  *  * '  x(n)  t*ie  arran9einent  of  data  in 

increasing  order  of  magnitude,  of  the  sample,  i.e.  x^  is  the 
smallest  measurement  and  x^  is  the  largest  measurement  in  the 
sample. 


Definition  2.2.  The  sample  mean  denoted  by  x  is  defined  as  the 
arithmetic  mean  of  measurements; 


-  1 


=  —  (x,+x.+. 
n  '  1  2 


. +x  )  =  i 
n  n 


n 


i=l 


The  sample  median  denoted  by  x  is  defined  as  the  measurement  of 
the  middle  magnitude  in  the  sample; 


x  = 


:(¥) 


*4)  *  ■(«•■) 11  ”  ‘ 


s  even. 
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The  sample  mean  or  sample  median  describes  the  "central 
tendency"  of  the  sample  (sample  distribution) . 

2 

Definition  2.3.  The  sample  variance  denoted  by  s  is  defined 
as 


2  1  —  2 
s  =  — y  I  (x .  -x ) z  and 

n_1  i=l  1 

the  sample  standard  deviation  denoted  by  s  is  defined  as 

f  i  n  _  -Tl  1/2  _ 


=  z  (xrx)  =  [s"]1/2. 

Ln  1  i=i  1 


The  sample  standard  deviation  indicates  the  "dispersion"  of 
the  sample  distribution. 

Data  A.  Suppose  that  in  an  experiment  of  tossing  a  coin  three 
times  we  have  observed  "the  number  of  heads".  The  outcomes  of 
the  experiment  in  10  trials  are; 

xl'l'  x2=0'  *3=2 ,  x4=2,  x5=3, 

X6=0,  x7=1'  x8=1'  x9=2 '  X10=2* 

In  the  above  experiment,  our  interest  is  "the  number  of 
heads" . 

The  arrangement  of  the  data  in  increasing  order  of  magnitude 


x(l)-°, 

X(2)-0' 

x(3)=l. 

X  ( 4 )  1 ' 

% 

II 

in 

X 

CM 

II 

VO 

X 

x(7)=2' 

<N 

II 

00 

X 

X  ( 9 ) =  2 ' 

X(10)~3* 

Example  2.1.  The  computations  of  x,  x  and  s  for  Data  A. 


.  9334 


s2  =  j~j  [  (1-1.  4)  2+  (0-1.  4)  2+. .  .+  (2-1.  4)  2]  = 

s  =  [s2]1/2  =  (.  9334)  1/2  =  .9661. 

When  there  is  a  large  number  of  measurements  in  a  sample, 
the  information  of  the  sample  is  usually  summarized  as  a  "fre¬ 
quency  distribution".  This  is  done  by  grouping  the  raw  data 
into  adjoining  classes  of  measurements.  The  number  of  measure¬ 
ments  in  each  class  gives  the  "frequency"  for  that  class  of 
measurements.  This  grouping  evidently  suppresses  the  detail  of 
individual  measurements,  but  it  provides  a  comprehensive  "picture 
of  the  sample.  For  example,  it  shows  in  which  class  the  measure¬ 
ments  occur  most  often  and  over  what  range  of  classes  the  measure 
ments  occur  a  large  portion  of  the  time.  The  "relative  frequency 
is  defined  as  the  frequency  divided  by  the  total  number  of  sam¬ 
ples. 

Example  2.2.  The  frequency  distribution  for  Data  A. 


Class 

(x.) 

Frequency 

(fi> 

Relative 

Frequency  (p^) 

0 

2 

.2 

1 

3 

.3 

2 

4 

.  4 

3 

1 

.  1 

Total 

n=10 

1 

4 


Data  B.  Tensile  strength  data  for  the  composite  AS/3501-5A/10 
ply/0°  in  KSI ,  (ranked  data) . 
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23 
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6 
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24 

198 

218 

7 

226 
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25 

152 

220 

8 

204 

196 

26 

218 

220 

9 

223 

198 

27 

220 

223 

10 

225 

200 

28 

196 

224 

11 

218 

200 

29 

186 

225 

12 

244 

201 

200 

225 

13 

208 

204 

31 

214 

226 

14 

196 

204 

32 

213 

226 

15 

200 

206 

33 

234 

234 

16 

204 

207 

34 

211 

237 

17 

184 

208 

35 

217 

238 

18 

195 

209 

36 

196 

244 
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Frequency 


Example  2.3.  The  frequency  distribution  (grouped  data)  for 
Data  B. 


Strength  Class 
(KSI ) 

Class  Boundary 

Midpoint  of 
Strength  Class 
(x*) 

Number  of 

Measurements 

in  Class 

Frequency  (f.) 

1 

Relative 

Frequency 

149.5  -  159.5 

154.  5 

2 

.  0555 

159.5  -  169.5 

164.5 

0 

0 

169.5  -  179.5 

174.5 

0 

0 

179.5  -  189.5 

184.5 

2 

.  0555 

189.5  -  199.5 

194.5 

5 

.139 

199.5  -  209.5 

204.5 

9 

.250 

209.5  -  219.5 

214.  5 

6 

.167 

219.5  -  229.5 

224.5 

8 

.222 

229.5  -  239.5 

234.5 

3 

.  083 

239.5  -  249.5 

244.5 

1 

.  028 

Total 

_ 

36 

_ 

1.  000 

Figure  2.2.  Histogram  or  Frequency  Distribution  for 
Data  B. 


6 


The  first  column  of  Example  2.3  lists  the  "class  boundaries", 
defining  the  class  to  which  each  measurement  is  assigned.  These 
boundaries  are  usually  given  to  a  higher  significant  digit  than  the 
sample  measurements,  to  ensure  that  measurements  can  be  grouped 
unambiguously.  Class  boundaries  are  preferably  chosen  to  give 
equal  widths  (e.g.  10  KSI  in  Example  2.3).  The  class  width  is 
selected  to  make  tabulation  convenient,  and  to  result  in  6  to  15 
groups  when  covering  the  entire  range  of  measurements. 

Column  2  of  Example  2.3  lists  the  "midpoint"  of  each  class. 
This  value  is  needed  to  represent  the  measurements  in  the  class. 
Column  4  lists  the  "relative  frequency"  (relative  to  the  sample 
size).  Relative  frequency  tabulations  are  the  usual  form  in 
which  sample  data  are  presented  for  statistical  analysis.  A 
"histogram"  presents  the  number  of  occurrences  in  a  class  as  the 
height  of  a  rectangle  over  the  corresponding  measurement  inter¬ 
val  (see  Figure  2.2). 

The  central  tendency  and  dispersion  of  the  sample  distribu¬ 
tion  can  be  computed  directly  from  the  grouped  data  (as  noted  by 
the  symbol  *) ; 


x*  =  —  E  xtf.  and 
n  i=l  11 


s*  = 


-iy  E  (x*  — X* )  2  f. 
n-i  ,=1  i 


1/2 


1 

n-1 


2 


V  *  C 

—  E  x.  f. 

nli=l  i  1 


1/2 


where 

n  =  sample  size, 
k  =  number  of  classes. 


7 


8 


V  — 

X 

(18)  +  X  ( 19 ) 

_  209  +  211  _  „1A 

A 

(b)  From 

2  2  a.  j. 

the  grouped  data  (Example  2.3); 

X* 

1 

f  . 

x*  f . 

<x*)2 

(x*)2f . 

1 

1  1 

1 

1  1 

154.  5 

2 

309.0 

23,870.25 

47,740.50 

164.5 

0 

0 

27,060.25 

0 

174.5 

0 

0 

30,450.25 

0 

184.5 

2 

369.0 

34,040.25 

68,080.50 

194.5 

5 

972.  5 

37,830.25 

189,151.25 

204.  5 

9 

1,840.5 

41,820.25 

376,382.25 

214.5 

6 

1,287.0 

46,010.25 

276,061.50 

224.  5 

8 

1,796.0 

50,400.25 

403,202.00 

234.5 

3 

703.5 

54,990.25 

164,970.75 

244.5 

1 

244.5 

59,780.25 

59,780.25 

10 

Z  x*f . 
i-1  1  1 

7,522.  0 

10  2 
z  (xjrt.  =  1, 

585,369.00 

i-i  1  1 

X*  = 


:*  = 


1  k 
-  Z 

n  i-1 

x*f . 

1  i 

=  ■—  7,522  =  208.94, 

1  J. 

'  k 

^  ,  /  k  \2l 

1 

n-1 

E 

(x*)2f.  -  -1  E  x*f.  J 

1  1  nVi=l  1  V  J 

H- 

II 

M 

1/2 


[l, 585, 369  -  3!  (7 , 522)  2]  | 


1/2 


=  19.82, 
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X*  =  L  +  — -  W  =  199.5  +  •  10 

m 

=  209.5  . 

When  we  compare  x  and  x*f  s  and  s*  and  x  and  x* ,  there  are 
small  discrepancies  due  to  the  fact  that  the  midpoints  of  classes 
in  the  frequency  distribution  represent  the  measurements  in  the 
corresponding  classes.  These  discrepancies  are  negligible  for 
reasonably  large  sample  size. 


3. 


POPULATION  DISTRIBUTIONS 


The  preceding  tabulations,  graphs,  and  measures  refer  to  a 
sample;  they  are  merely  descriptions  of  the  data  comprising  the 
sample.  Statistical  inferences  are  conclusions  on  a  population 
characteristic,  drawn  from  information  contained  in  the  sample. 

The  manifestation  of  a  random  phenomenon  is  a  sample  of 
observations.  The  measurement  value  of  an  observation  cannot  be 
predicted,  but  the  relative  frequency  of  occurrence  of  its  value 
tends  toward  a  stable  value  in  a  long  sequence  of  observations. 
Probability  theory,  therefore,  abstracts  the  random  phenomenon 
by  dealing  only  with  the  existence  of  a  stable  frequency  pattern. 

Definition  3.1.  The  outcome  of  an  observation  on  a  random 
phenomenon  is  called  a  random  event,  and  the  totality  (population) 
of  all  possible  distinct  events,  which  are  associated  with  a 
particular  random  phenomenon,  is  called  the  sample  space. 

The  investigation  of  an  engineering  phenomenon  involves 
measurements  on  a  relevant  characteristic  of  that  phenomenon. 

Such  a  characteristic,  therefore,  can  be  represented  by  a  "meas¬ 
urement  variable" .  In  the  investigation  of  such  a  measurement 
variable,  the  occurrence  of  a  particular  measurement  x  is  an 
"event"  that  is  represented  by  a  specific  real  number.  The  var¬ 
iable  X,  which  gives  rise  to  measurement  x,  is  then  termed  a 
"random  variable".  The  measurement  value  x  is  termed  a  realiza¬ 
tion  of  the  random  variable  X. 

The  relative  frequency  of  occurrence  of  a  random  event  is 
represented  by  the  probability  of  that  event.  It  follows  that 
each  measurement  x  is  associated  with  a  probability  value, 

Pr(X=x},  which  represents  the  long-run  relative  frequency  of  the 
random  variable  X  taking  on  the  measured  value  x. 

In  the  analysis  of  random  phenomenon,  two  types  of  random 
variables  are  frequently  encountered. 
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Definition  3.2.  A  random  variable  X  is  called  discrete  if  it 
only  assumes  finite  or  countably  infinite  distinct  measurement 
values.  A  random  variable  X  is  called  continuous  if  X  can  be 
regarded  as  realizable  over  a  continuous  segment  of  the  real 
line,  i.e.  a  realization  x  may  be  any  real  number  within  some 
interval. 

Example  3.1.  Examples  of  random  variables; 

(a)  Discrete; 

X  =  the  number  of  heads  in  a  game  of  tossing 
a  fair  coin  3  times, 

Y  =  the  number  of  defects  in  a  product, 

Z  =  the  number  of  arrivals  of  cargo  vessels 
at  a  dock. 

(b)  Continuous; 

T  =  the  measurements  on  the  life  length  of 
devices, 

S  =  the  strength  of  a  composite  material, 

U  =  the  flight  range  of  a  missile. 

Example  3.2.  Examples  of  events  (continuation  of  Example  3.1) 

{X=2 }  =  the  event  that  X  is  equal  to  2, 
i.e.  the  event  that  the  number  of 
heads  is  2,  and 

{S>50}  =  the  event  that  the  strength  of 
a  composite  material  is  greater 
than  50. 
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(A)  Discrete  Random  Variables 


It  is  defined  that  a  discrete  random  variable  X  can  assume 
finite  or  countably  infinite  values,  (say,  a^,  a2»  ...»  a^)  and 
the  probability  value  Pr.{X=ai>  (i=l,  2,  ...,  k)  represents  the 
long-run  relative  frequency  of  the  random  variable  X  taking  the 
value  a^.  These  long-run  relative  frequencies  form  a  function, 
which  is  called  the  probability  mass  function  of  X. 

Definition  3.3.  p(a^)  =  Pr{X=a^}  as  a  function  of  a^,  i  =  1,  2, 

...»  k,  is  called  the  probability  mass  function  (sometimes 
abbreviated  as  p.m.f.)  of  the  random  variable  X. 

The  graph  of  p.m.f.  of  X  is; 


P(o) 


Figure  3.1.  p.m.f.  of  X 
The  properties  of  p(a^)  are, 

1.  P^)  >  0,  all  a^ 

2.  £  pU^  =  1, 


which  follow  from  the  properties  of  long-run  relative  frequency 
of  random  variable. 


Example  3.3.  The  random  variable  X  indicates  the  number  of  heads 
in  an  experiment  of  tossing  a  fair  coin  three  times. 

The  sample  space,  S,  is 

(HHH,  HHT,  HTH,  THH,  HTT,  THT,  TTH ,  TTT} 


and 


P r{HHH}  =  P  {each  outcome}  =  4 
This  random  variable  X  takes  values  from 

(0,  1,  2,  3}  and 


p(0)  =  Pr{X=0}  =  Pr(TTT}  =  | 


p  (1) 

=  Pr(X= 

1} 

=  Pr(HTT, 

THT, 

TTH} 

3 

8 

9 

P  (2 ) 

ii 

X 

^-r-> 

u 

PU 

II 

2} 

=  Pr(HHT, 

HTH, 

THH} 

3 

8 

9 

P  (3 ) 

=  Pr {X= 

3} 

=  Pr(HHH} 

1 

8 

• 

p.m.f.  of 

X  is; 

k 

0 

1 

2 

3 

P  (k) 

1/8 

3/8 

3/8 

1/8 
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One  of  the  most  important  concepts  in  statistics  is  that  of 
a  mathematical  expectation  (or  expected  value).  Let  p(a.), 
i=l,  2,  .  ..,  k  be  the  probability  mass  function  of  a  random 
variable  X. 


Definition  3.4. 


(a)  The  expected  value  (mean)  of  the  random  variable  X  is 
defined  by 

k 

y  =  E(X)  =  Z  a  •  p  (a .  )  , 
i=l 

(b)  The  variance  of  the  random  variable  of  X  is  defined  by 

a2  =  Var(X)  =  E(X-y)2  =  Z  (a  . -y )  2p  (a .  )  , 

i=l  1  1 

^  2  2 
=  Z  afp (a. )  -  y 

i=l  1  1 

(c)  The  standard  deviation  of  the  random  variable  X  is 
defined  by 

o  =  s.d. (X)  =  [Var(X)]1/2, 

(d)  In  general,  for  a  function,  g(x),  the  expectation  of 
g(X)  is  defined  by 


E[g(X)]  =  Z  g  (a.  )p  (a . ) . 
i=l 


Example  3.4.  Let  p.m. f.  of  a  random  variable  X  be 


p(0)  =  1/8,  p(l)  =  3/8,  p (2)  =  3/8  and  p(3)  =  1/8 


(See  Example  3.3),  then 
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It  is  noted  that  the  random  variable  X,  "the  number  of  heads 
in  tossing  a  fair  coin  three  times",  given  in  Example  3.3,  has 
such  a  simple  structure  that  its  probability  distribution  (popu¬ 
lation  distribution)  has  been  easily  figured  out.  It  is  not 
always  easy  to  characterize  or  find  the  probability  distribution 
for  most  engineering  measurement  random  variables.  The  distinc¬ 
tion  between  sample  relative  frequency  distribution  (mean  or 
standard  deviation)  and  population  probability  distribution 
(mean  or  standard  deviation)  can  be  clearly  observed  from  Exam¬ 
ple  2.2  (Example  2.1)  and  Example  3.3  (Example  3.4).  The  role 
of  statistics  is  to  draw  some  conclusions  on  the  population 
(random  variable  X)  from  the  information  contained  in  a  sample 
of  measurements  on  X. 

Now  we  are  going  to  consider  continuous  random  variables. 

(B)  Continuous  Random  Variables 

Definition  3.4.  The  probability  of  a  continuous  random  variable 
X  taking  a  measurement  value  in  the  interval  (x-^£,  x+^p) ,  as  a 
function  of  x,  is  written  in  terms  of  the  probability  density 
function,  (p.d.f.),  f (x)  as 
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Pr{x  "X-X-x  +  ¥}  =  f(x)dx- 


The  sample  relative  frequency,  when  the  class  interval 
reduces  toward  zero  and  the  sample  size  increased  sufficiently 
large,  approaches  a  smooth  curve,  which  will  be  p.d.f.  f(x). 

The  properties  of  p.d.f.  f (x)  are, 

1.  f(x)  >  0  for  all  real  number  x 

2 .  f°°  f  (x)  dx  =  1. 

•L  00 

Definition  3.5.  The  function  F(x)  =  Pr{X<x}  =  fx  f(u)du  is 

—  OO 

called  a  cumulative  distribution  function  (c.d.f.)  of  X. 

We  note  that  Pr(a<X<b}  =  J"a  f  (x)dx  =  F(b)  -  F(a)  for  any 
real  numbers  a  and  b  with  a<b. 

The  relations  between  p.d.f.  f  (x)  and  c.d.f.  F(x)  are 
F  (x)  =  fx  f  (u)  du  and  d-^x-^  =  f  (x)  . 

—  OO 

For  a  continuous  random  variable  X,  Pr{X=a}  =  0  for  any  real 
number  a. 

Just  like  discrete  random  variables,  the  expected  values 
for  continuous  random  variable  are  also  important  concepts. 
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Definition  3.6.  Let  a  random  variable  X  have  p.d.f.  f  (x) . 

(a)  The  expected  value  (mean)  of  the  random  variable  X  is 
defined  by 

y  =  E (X)  =  f  xf(x)dx, 

•Loo 

(b)  The  variance  of  the  random  variable  X  is  defined  by 


=  Var(X)  =  E(X-y)2  =  f*  (x-y )  (x)  dx,  and 


(c)  In  general,  for  a  function  g(x),  the  exception  of  g(X) 
is  defined  by 


Etg(X)]  =  r  g  (x) f  (x) dx. 

—  oo 

It  is  noted  that  mean  y=E (X)  describes  the  "central  tend¬ 
ency"  of  the  (population)  probability  distribution  f(x)  and 
standard  deviation  a  indicates  the  "dispersion"  of  the  probability 
distribution  f (x) . 

Example  3.5.  Let  random  variable  X  have 

p.d.f.  f ( x )  =  e_x,  for  x  >  0. 

This  is  a  probability  density  function  since 

f  (x)  =  e  x  >  0  and  f(x)dx  =  f  e  xdx  =  1. 

•L  ro  J  n 


(X)  =  r  xf(x)dx  =  f*  xe  Xdx  =  1, 
•Loo  Jo 


Var(X)  =  E  (X2 )  -  y2  =  f°°  x2f(x)dx  -  l2 

•Loo 


f°°  x2e'Xdx  -1=2-1=  1, 

J  n 
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F (x)  =  fx  f(u)du  =  fX  e  Udu  =  1  -  e  X,  x  >  0. 
■Lo  J  0 


Comparison  of  Discrete  and  Continuous 
Random  Variables 


Discrete 

- 1 

Continuous 

range 

amm 

! 

real  line  (-00,00)  i 

probability 

distribution 

p.m.f.  p^ 

p. d. f .  f (x) 

mean  y 

Z  x  .  p  . 

1*1 

/  xf (x) dx 

2 

variance  a 

r  2  2 

E  xiPi  -  y 

J  x^f (x) dx  -  y  ^ 

4. 


VARIOUS  CONTINUOUS  DISTRIBUTIONS 


The  statistical  model  of  a  random  phenomenon  is  formulated 
in  terms  of  either  p.d.f.  f(x)  or  c.d.f.  F(x). 

Random  phenomena  exhibit  a  great  variety  of  forms  of  specific 
relative  frequency  patterns.  A  statistical  model  of  such  a  pattern 
should  be  flexible  enough  to  accommodate  a  variety  of  relative 
frequency  patterns.  To  achieve  such  flexibility,  a  specific 
mathematic  function  f (x)  or  F(x)  is  generalized  by  introducing 
into  it  a  suitable  "distribution  parameter",  denoted  by  6  =  (0  , 

0£,  ...,  0m) .  The  resulting  generalized  statistical  model  is 
called  a  family  of  density  functions  or  distribution  functions. 

Thus,  a  general  statistical  model  is  f(x;0)  or  F(x,0),  where 
the  symbol  0  represents  the  parameters  which  index  that  model. 

The  central  aim  of  statistical  analysis  can  now  be  restated 
to  construct  a  statistical  model  f(x,0)  or  F(x,0)  of  the  random 
variable  X  on  the  basis  of  the  incomplete  information  contained 
in  a  sample  of  measurements  on  X. 

We  introduce  here  the  families  of  distribution  functions 

(A)  Exponential  Distributions 

(B)  Normal  Distributions 

(C)  Weibull  Distributions 

(D)  Chi-Square  Distributions, 

which  may  be  suitable  for  use  in  composite  engineering  applica¬ 
tions. 

(A)  Exponential  Distributions 

A  random  variable  X  is  said  to  have  an  exponential  distribu¬ 
tion  if  its  p.d.f.  is  of  the  form 

f  (x;  0 )  =  0e  0  X,  x  >  0.  (0  >  0), 

where  0  is  a  parameter. 
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The  c. d. f .  of  X  is , 


F (x)  =  f  f (u) du  =  f  0e~6udu  =  1  -  e~6x,  x  >  0. 

-°°  *'o 

The  expected  mean  value  and  variance  of  X  are; 

OO  CV> 

y  =  E  (X)  =  J  xf(x)dx  =  J*  x0e  0xdx  =  -g-  , 

-oo  0 

E (X2 )  =  f  x20e~Oxdx  =  ,  and 

o  ez 

°2  -  Var<x>  -  e(x2)  -  “2  -  fs  -  (?)2  '  ^  • 

Most  applications  of  the  exponential  distribution  are  based 
on  its  "memoryless  property",  when  the  measurement  random  varia¬ 
ble  X  has  a  time  dimension.  This  property  refers  to  the  feature 
of  a  phenomenon  in  which  the  history  of  past  events  does  not 
influence  the  probability  of  occurrence  of  present  or  future 
events. 

(B)  Normal  Distributions 

The  normal  family  of  probability  density  functions  is 
defined  as 

f  (x;y,a)  =  — - —  exp  -  ^ 
o/2tt  l 

-00  <  x  <  °°,  -00  <  y  <  oof  a  >  0 

where  y  is  a  location  parameter  and  a  is  a  scale  parameter. 

We  can  denote  this  distribution  by  X  ~  N(y,o)  if  a  random 
variable  has  the  above  p.d.f.. 

j 

Theorem  4.1.  If  X  ~  N(y,o),  then  E(X)  =  y  and  s.d. (X)  =  a. 

This  theorem  implies  that  the  parameters  y  and  a  are  pre¬ 
cisely  the  mean  and  standard  deviation  of  the  random  variable  X 
respectively. 
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If  a  random  variable  Z  has  the  normal  distribution  with 
U=0  and  o=l,  i.e.  Z  ~  N(u, i),  then  Z  is  said  to  have  the  stand¬ 
ard  normal  distribution.  The  p.d.f.  of  Z  is 

f(z;0,l)  =  — exp  (-■%-)  ,  -oo  <  z  <  oo 

/2?  '  2  / 


Figure  4.1.  Normal  Density  Function 


Figure  4.2.  Standard  Normal  Density  Function 

Both  pictures  of  the  probability  densities  of  X  and  Z  are 
identical  except  for  the  scales  on  the  horizontal  lines.  They 
are  bell-shaped  curves  and  symmetric  about  the  means.  The  fol¬ 
lowing  theorem  is  useful  in  computing  the  probabilities  of  ran- 

j 

dom  variables  X  ~  N(y,o). 

Theorem  4.2.  If  X  ~  N(y,cr),  then 

z  =  d  N(0/1)  . 
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of  the  standard  normal  distribu- 


The  values  for  the  c.d.f. 
tion  is  given  in  Table  I. 


TABLE  I.  NORMAL  DISTRIBUTION 


F(x) 


dt 


!- 

.00 

.01 

.02 

.03 

.04 

.05 

.06 

.07 

.08 

.09  j 

.0 

.5000 

.5040 

.5080 

— 

.5120 

.5160 

.5199 

.5239 

.5279 

.5319 

.5359 

.1 

.5398 

.5438 

.5478 

.5517 

.5557 

.5596 

.56  36 

.5675 

.5714 

.5753 

.2 

.  5793 

.5832 

.5871 

.5910 

.5948 

.5987 

.6026 

.6064 

.6103 

.6141  1 

•3 

.6179 

.6217 

.6255 

.6293 

.6331 

.6  368 

.6406 

.6443 

.6480 

.6517  | 

.4 

.6554 

.6591 

.6628 

.6664 

.6700 

.6736 

.6772 

.6808 

.6844 

.6879  | 

.5 

.6915 

.6950 

.6985 

.7019 

.7054 

.7088 

.7123 

.7157 

.7190 

.7224 

.6 

.7257 

.7291 

.7324 

.7357 

.7389 

.7422 

.7454 

.7486 

.7517 

.7549 

.7 

.7580 

.7611 

.7642 

.7673 

.7704 

.7734 

.7764 

.7794 

.7823 

.7852 

.3 

.7881 

.7910 

.7939 

.7967 

.7995 

.8023 

.8051 

.8078 

.8106 

.8133  | 

.9 

.3159 

.8186 

.8212 

.8238 

.8264 

.8289 

.8315 

.8340 

.8365 

.8389 

1.0 

.3413 

.34  38 

.8461 

.8485 

.8508 

.8531 

.8554 

.8577 

.8599 

.8621 

i  1.1 

.8643 

.8665 

.8686 

.8708 

.8729 

.8749 

.8770 

.8790 

.8810 

.8830 

1.2 

.8849 

.8869 

.8888 

.8907 

.3925 

.8944 

.8962 

.8980 

.3997 

.9015 

1.3 

.9032 

.9049 

.9066 

.9082 

.9099 

.9115 

.9131 

.9147 

.9162 

.9177 

1.4 

.9192 

.9207 

.9222 

.9236 

.9251 

.9265 

.9279 

.9292 

.9306 

.9319 

1.5 

.9332 

.9345 

.9357 

.9370 

.9382 

.9394 

.9406 

.9418 

.9429 

.9441 

1.6 

.9452 

.946  3 

.9474 

.9484 

.9495 

.9505 

.9515 

.9525 

.9535 

.9545  ' 

i  i-7 

.9554 

.9564 

.9573 

.9582 

.9591 

.9599 

.9608 

.9616 

-9625 

.9633 

1.8 

.9641 

.9649 

.9656 

.9664 

.9671 

.9678 

.9686 

.9693 

.9699 

.9706 

1.9 

.9713 

.9719 

.9726 

.9732 

.9738 

.9744 

.9750 

.9756 

.9761 

.9767 

2.0 

.9772 

.9778 

.9783 

.9788 

.9793 

.9798 

.9803 

.9808 

.9812 

.9817 

2.1 

.9821 

.9826 

.9830 

.9834 

.9838 

.9842 

.9846 

.9850 

.9854 

.9857 

j  2.2 

.9861 

.9864 

.9868 

.9871 

.9875 

.9878 

.9881 

.9884 

.9887 

.9890 

2.3 

.9893 

.9896 

.9898 

.9901 

.9904 

.9906 

.9909 

.9911 

.9913 

.9916 

2.4 

.9918 

.9920 

.9922 

.9925 

.9927 

.9929 

.9931 

.9932 

.9934 

.9936 

2.5 

.9938 

.9940 

.9941 

.9943 

.9945 

.9946 

.9948 

.9949 

.9951 

.9952 

2.6 

.9953 

.9955 

.9956 

.9957 

.9959 

.9960 

.9961 

.9962 

.996  3 

.9964 

2.7 

.9965 

.9966 

.9967 

.9968 

.9969 

.9970 

.9971 

.9972 

.9973 

.9974 

2.8 

.9974 

.9975 

.9976 

.9977 

.9977 

.9978 

.9979 

.9979 

.9980 

.9981 

2.9 

.9981 

.9982 

.9982 

.9983 

.9984 

.9984 

.9985 

.9985 

.9986 

.9986 

3.0 

.9987 

.9987 

.9987 

.9988 

.9988 

.9989 

.9989 

.9989 

.9990 

.9990 

3.1 

.9990 

.9991 

.9991 

.9991 

.9992 

.9992 

.9992 

.9992 

.9993 

.9993 

3.2 

.9993 

.9993 

.9994 

.9994 

.9994 

.9994 

.9994 

.9995 

.9995 

.9995 

3.3 

.9995 

.9995 

.9995 

.9996 

.9996 

.9996 

.9996 

.9996 

.9996 

.9997 

3.4 

.9997 

.9997 

.9997 

.9997 

.9997 

.9997 

.9997 

.9997 

.9997 

.9998 

X 

1.282 

1.645  1.960 

2.326  2.576  3.090  3.291 

3.391 

- 

4.417 

F(x) 

.90 

•95 

.975 

.99  1  .995  .999 1  .9995 

.99995 

.999995 

2(1  -  F (x) J 

.20 

.10 

.05 

.02  .01  .002  .001 

.0001 

.00001 

from  "Introduction  to  the  Theory  of  Statistics"  by  A.M.  Mood,  McGraw-Hill, 
New  York,  1950. 


Example  4.1.  Let  Z  ~  N(0,1),  then 


(a)  Pr{0<Z<1.5}  =  Pr(Z<1.5}  -  Pj. {Z<0 } 

=  F (1. 5)  -  F(0),  where  F(z)  is  c.d.f.  of  Z 
=  .9332  -  .5  (from  Table  I) 

=  .4332 


(b)  Pr{-l<Z<1.5}  =  .4332  +  .3413  =  .7745 


Definition  4.1.  A  symbol  z^  is  defined  to  designate  a  number 
satisfying  Pr{Z>zv}  =  v  (0<v<l), 


(see  the  following  example) . 


Example  4.2. 

(from  Table  I) 

V 

z 

V 

.  05 

1.  645 

.  025 

1.  960 

.  01 

2.  326 

Example  4.3. 

If  X  -  N (70,5) ,  then 

P  {X>60 }  =  P  | 
r  -  r  1 

X-70  „  60-70) 

:  5  -  5  f 

=  P  { Z  >-2 }  =  P 

r  -  r 

,{Z<2}  =  .9772 

The  normal  distribution  is  undoubtedly  the  best-known  sta¬ 
tistical  measurement  model.  This  distribution  has  wide  applica¬ 
bility  because  of  the  central  limit  theorem,  which  will  be  pre¬ 
sented  in  a  later  section. 


(C)  Weibull  Distributions 


The  family  of  Weibull  probability  density  functions  (two 
parameter  family)  is  defined  as 


f (x; 6, a) 


t 


x,8,a  >  0, 


where  8  is  a  scale  parameter  and  a  is  a  shape  parameter.  The 
Weibull  c.d.f.  is  found  directly  as 


F (x; 8, a) 


x 

f  f(x';8,a)dx' 

•'o 


t 


x  >  0. 


If  a  random  variable  X  has  the  above  Weibull  distribution,  then 
we  will  write  X  ~  W(8,ot). 
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Pr{*  i 


The  Weibull  model  can  be  used  to  represent  a  wide  variety  of 
engineering  random  phenomena  and  also  the  Weibull  distribution 
is  used  to  model  the  breaking  strength  and  fatigue  life  of  metals 
and  composite  materials. 


(D)  Chi-Square  Distributions 

A  random  variable  X  has  a  Chi-square  distribution  with  n 

d  2 

degree  of  freedoms  (we  write  X  ~  X  (n) )  if  its  p.d.f.  is 


Figure  4.4.  Chi-Square  Density  Function 
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Theorem  4.4.  If  X  ~  x^(n)»  then 

E (X )  =  u  =  n  and  Var(X)  =  =  2n. 

.  .  .  2  2 
Definition  4.2.  Symbols  xy(n)  and  x^_^(n)  are  defined  to  desig¬ 
nate  the  numbers  satisfying, 

Vx>*v(n>>  =  v  and  pr(>->  xi-v  =  1-v» 
where  X  ~  x  2 (n) . 


C(x) 


X“  ~  density  "unction 


(See  the  following  example.) 


The  Chi-square  distribution  is  extensively  used  for  infer- 

2 

ences  on  the  normal  variance  o  ,  and  in  certain  distribution 
tests. 

The  values  for  the  Chi-square  distribution  are  given  in 
Table  II. 
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TABLE  II.  CHI-SQUARE  DISTRIBUTION 
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Example  4.6.  (Hahn-Kim  and  Yang's  residual  strength  degradation 
models  for  the  composite  material  fatigue  failures). 

Let  S  =  applied  load  at  each  fatigue  cycle, 

R(n)  =  residual  strength  at  nth  fatigue  cycle  (random 
variables) 

R ( 0)  =  the  static  strength  (random  variable) 

N  =  the  fatigue  life  strength  in  cycle  (random 
variable) . 


Assumptions; 

(a)  R ( 0)  ~  W(6,a) 

(a  and  6  can  be  estimated  from  the  static  strength 
data) , 

(b)  The  degradation  rule; 

RC(n)  =  R ( 0) C  -  £3CKSbn 

(c,  K  and  b  can  be  estimated  from  the  residual  strength 
data) . 

The  c.d.f.  of  the  residual  strength  R(n)  is; 

FR(n)(x)  =  Pr(R(n)<x}  =  Pr(Rc(n)<xc} 

=  Pr {Rc (0) -3cKSbn<xc}  (from  the  assumption  (b) ) 

=  Pr { Rc ( 0 ) <xc+  BcKSbn } 

=  Pr (R(0) < (xc+BcKSbn) 1/Cl 


Note  that  the  fatigue  life  strength  N  has  a  three  parameter 
Weibull  distribution. 
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5.  JOINT  DISTRIBUTIONS  (CONTINUOUS  RANDOM  VARIABLE) 

The  statistical  model  F(x,y;0),  which  describes  the  joint 
behavior  of  two  random  variables  X  and  Y,  is  called  joint  cumu¬ 
lative  distribution  function  or  joint  c.d.f. .  Its  probability 
interpretation  is 


F(x,y;0)  =  Pr{X<x  and  Y<y|0}. 


is 


For  two  continuous  random  variables  X  and  Y,  the  joint  p.d.f. 


f(x,y*0)  =  9  fo '  Y '  9 1 
'x'y'  '  3x3y 


so  that 


F  ( 


rx  rY 

x,y;0)  =  /  /  f(u,v; 

—  00  oo 


0 ) dudv. 


The  joint  p.d.f.  f(x,y;0)  satisfies 
(a)  f(x,y;0)  >  0  for  any  real  x  and  y 


(b) 


f (x, y ; 0 ) dxdy 


1. 


From  a  joint  p.d.f.  f(x,y)  of  random  variable  X  and  Y,  one 
can  obtain  p.d.f.  of  X  (or  Y)  alone  (call  marginal  p.d.f.  of  X 
(or  Y) )  by  integrating  out  as  follows; 


fx(x) 


f (x,y)dy 


f 


2 


f (x,y) dx. 


Example  5.1.  Let  the  joint  p.d.f.  of  two  exponential  variables  be 


f(x,y;01,02)  =  0  ^0  2  exp{-01x-02y  }  • 


The  marginal  p.d.f.  of  X  is  obtained  by  integrating  out  variable 

y; 


f  1  (x;  Gj^) 


f (x,y;01,02)dy  =  Gj 


exp {-0j x } . 


Definition  5.1.  The  conditional  probability  density  function  of 
X  given  Y=y  is  defined  as 


f(x|y)  = 


f9(y) 


where  f(x,y)  is  the  joint  p.d.f.  of  X  and  Y  and  f2(y)  is  the 
marginal  p.d.f.  of  Y,  provided  that  f(y)>0. 

The  concept  of  conditional  p.d.f.  is  introduced  to  predict 
the  probabilistic  behavior  of  the  first  random  variable  in  terms 
of  a  specific  value  of  the  second  random  variable. 


Example  5.2.  Let  the  joint  p.d.f.  of  (XfY)  be 


4x2*  dy 


0<y<l . 


0<x<l 


f (x|y) 


f (x,y)  _  4x2 

f2(*>  "  i(l-y3) 


0<y<x<l, 


f (y|x) 


f (x,y) 
(x) 


0<y<x<l. 
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The  conditional  p.d.f.  f(x|y)  contains  information  on  the 
dependence  of  X  on  Y.  The  covariance  is  a  measure  of  the  disper¬ 
sion  of  both  X  and  Y,  just  as  the  variance  measures  the  dispersion 
of  a  single  random  variable. 

Definition  5.2.  Let  random  variable  (X,Y)  have  joint  p.d.f. 
f(x,y).  The  covariance  of  X  and  Y,  denoted  by  cov(X,Y),  is 
defined  by, 


COV (X, Y)  =  E [  (X-EX)  (Y-EY) ]  =  E  (XY)  -  (EX)(EY) 

(x-ux) (y-Uy)  f (x,y)dxdy  . 

The  correlation  coefficient  of  X  and  Y,  denoted  by  p (X,Y) , 
is  defined  by 


P  (X ,  Y )  = 


cov (X, Y) 


s.d.  (X)  s.d.  (Y) 


Example  5.3.  f(x,y)  =  4x  ,  0<x<l,  o<y<x. 

.  3  ■,  r1  .  4.  4  5 1 1 

x*4x  dx  =  I  4x  dx  =  Tx 

0  5 


yx  =  E(x) 


A  3  r1 
=  I  x*  4x  dx  =  j 

J  0  J  C 


4 

5 


=  E(Y)  =  f  y|(l-y3)dy  =  ( y-y 4 

=  i(y!  _  y!\  1  =  i  .  (I  _  l)  =  1 

3  V  2  5  /  n  3  \  2  5  /  5 


)dy 


E  (XY) 


-P°  p™  L  X 

=  11  xyf(x,y)dxdy  =  f  f  x-y4x^dy 

■—  oo  —  oo  J  0  J  0 

■/. 


dx 


4x3  • 


dx  =  f  2x^dx  =  jr 
0  ^0  6 


1 

3 
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E(X2) 


■  /  *  h 


(x )  dx 


x24x3,dx  =  f  4x5dx  ~  \ 

n  J  n  6  i 


Var(X)  =  E  (X2 )  -  (EX)2  =  |  -  (|)  = 

Var(Y)  =  E(Y2)  -  (EY)2  =  |  -  (|)2  =  ^ 

COV (X, Y)  =  E (XY )  -  (EX) (EY)  =  j  -  |  •  |  =  . 

Suppose  that  simultaneous  measurements  on  two  random  varia¬ 
bles  are  taken.  These  random  variables  are  independent  (intui¬ 
tively)  if  any  information  on  one  random  variable  provides  no 
information  on  the  other  random  variable.  In  this  case,  the 
conditional  p.d.f.  of  one  random  variable,  given  any  value  of  an 
independent  random  variable,  equals  the  marginal  p.d.f.  of  the 
first  random  variable.  That  is, 

f (x|y;0)  =  f (x;0) . 

But,  from  the  definition  of  the  conditional  probability  (Defi¬ 
nition  5.1), 

f  (x | y ; 0 )  =  f (x,y; 0) /f  (y; 0) , 
thus  the  independence  of  X  and  Y  means  that 

f  (x,y;0)  =  f (x;0)  f (y;0) , 
for  any  real  x  and  y. 
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Definition  5.3.  Two  random  variables  X  and  Y  are  said  to  be 
"independent"  if  their  joint  p.d.f.  is  equal  to  the  product  of 
the  p.d.f.'s  of  X  and  Y,  that  is, 

f (x,y ; 0 )  =  f  (x; 0 )  •  f(y;0). 

Which  of  the  following  pairs  of  measurements  (events)  are 
independent? 

1.  number  of  years  of  education  vs.  amount  of  yearly 
income 

2.  having  green  eyes  vs.  being  a  banker 

3.  breaking  strength  of  composite  vs.  its  fatigue  life 

4.  number  appeared  in  1st  toss  of  a  die  vs.  number 
appeared  in  2nd  toss  of  a  die. 

(Answer;  (2)  and  (4)). 

Example  5.4.  Let  random  variables  X  and  Y  be 

X  =  number  appeared  in  1st  toss  of  a  die 
Y  =  number  appeared  in  2nd  toss  of  a  die 

Pr{X=3,  Y-4>  =  jj 

Pr{X-3)  -  £,  PrtY-4)  -  i 

Pr{X=3,  Y=4)  =  =  Pr  {X=3}-Pr  {Y=4}. 

Also  Pr(X=i,  Y=  j  }  =  Pr  {X=i}-Pr  {Y=j} 

for  i  and  j  =  1,2,.. .,6. 

Therefore,  X  and  Y  are  independent. 
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We  note  that  if  X  and  Y  are  independent  with  p.d.f.  fv(x) 

A 

and  fy(y),  then  their  joint  p.d.f.  f(x,y)  can  be  obtained  by 

f  (x,y)  =  fx (x) • fy  (y) . 

Example  5.5.  We  introduce  here  a  concept  of  failure  rate,  which 
is  one  of  the  basic  concepts  in  reliability,  in  terms  of  defi¬ 
nition  of  the  conditional  probability. 

Let  T  be  a  failure  time,  random  variable,  of  a  device  with 
c.d.f.  F { t)  and  p.d.f.  f  (t) . 

The  failure  rate  h(t)  at  time  t  is; 

h(t)dt  =  Pr  {instantaneous  failure  of  the  device  during 

the  time  interval  (t,t+dt) |it  survived  up  to  t} 

f  (t)dt 
l-F(t) 
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6.  SAMPLING  AND  ESTIMATIONS 

In  the  preceding  discussions  a  random  variable  X  represents 
a  measurable  characteristic  of  some  underlying  phenomenon  and  its 
p.d.f.  f (x; 0 )  is  the  probability  distribution  of  X.  Realizations 
of  that  random  variable  X  gives  rise  to  quantitative  information 
on  that  characteristic. 

Suppose  that  we  have  taken,  from  an  experiment,  a  sample  of 

n  measurements  x..  ,  x^,  .  ..,  x  on  a  random  variable  X  with  the 
1  2  n 

p.d.f.  f(x;@).  These  measurements  x^,  X2,  .  ..,  x^  are  considered 

as  a  realization  of  random  variables  X, ,  X_,  ...,  X  (random 

12  n 

sample) ,  which  we  are  going  to  define. 

Definition  6.1.  A  collection  of  random  variable  X, ,  X_,  ...,  X 
-  l  z  n 

is  called  a  random  sample  (of  size  n)  from  a  random  variable  X 
(or  from  a  p.d.f.  f(x;0))  if 

(a)  each  random  variable  X^  has  p.d.f.  f(x;0) 

(b)  X^ ,  X2,  ...,  Xn  are  independent  random  variables. 

We  note  that  each  sample  X^  is  a  random  variable  until  its 
measurement  x^  is  taken  and  its  p.d.f.  is  ffx^;©)  and  the  word 
"random"  in  the  random  sample  means  each  sample  value  X^  is  not 
influenced  in  any  way  by  any  other  sample  value  X^  (j^i). 

It  follows  clearly  that  the  joint  p.d.f.  of  a  random  sample 

x]/  X2 '  *  *  *  '  Xn  is 

f (x. ,x_, . . . ,x  :0)  =  f(x.;0)  f(xo;0)  —  f(x  ;0) 

1  z  n  1  z  n 

n 

=  7T  f  (x.  ;0)  . 
i=l  1 

n 

If  we  let  L  (0  =  7T  f(x.;0)  and  consider  L(0)  as  a  function  of 

i=l  1 

the  parameter  0,  then  it  is  called  the  likelihood  function  of  a 

sample  X. ,  X_,  . . . ,  X  . 

1  z  n 

Suppose  that  random  measurement  characteristic  X  has  a  p.d.f. 
f(x;0)  and  a  family  of  distributions  that  f(x;0)  belongs  to  is  a 
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known  family  (say  Normal  or  Weibull  family).  Only  unknown  cri¬ 
teria  about  f(x;0)  is  the  parameter  0,  whose  value  is  required 
to  be  estimated  in  a  statistical  inference  problem. 

,  A  A 

A  function  0n  =  0  (X^ ,  X^,  .  ..,  Xn)  of  a  random  sample  X^, 

X 2*  .  ..,  X^,  which  approximates  the  unknown  value  of  the  param¬ 
eter  0  in  a  certain  sense,  is  called  an  "estimator"  of  9.  As  a 
function  of  random  variables,  6^  is  a  random  variable  and  hence 

it  has  a  distribution  function.  A  realization  $  =  9  (x. ,  x_,  ...» 

A  A  n  1  2 

x  )  of  9  =  9 (X. ,  Xof  ...»  X  )  is  called  an  "estimate"  of  6  (an 

n  n  1  2  n 

estimate  of  9  is  a  number) . 

The  following  criterias  can  be  applied  to  characterize 
good  estimators; 

1.  Unbiased  if  E  n ]  =  9  for  each  n, 

2.  Consistent  if  P_{ |9  -9  I  >  e}  <  e 

for  any  real  number  e  and  sufficiently  large  n, 

3.  Minimum  variance  if  E(^n-0)^  <  E(0*-0)^  for  any 
other  estimator  0*  of  9  with  E(9*)  =  9, 

n 

4.  Maximum  likelihood  (M.L.E.)  if  L(9)  =  rr  f(x.;0) 

achieves  maximum  at  9  =  &  . 

n 

The  usual  procedure  to  obtain  M.L.E.  of  9  is  to  take  the 

-  j 

derivative  L(9)  and  solve  for  gg-  L(9)  =  0. 

For  various  families  of  distributions,  the  sample  mean,  X, 
and  sample  standard  deviation,  S,  are  good  estimates  for  the 
population  mean  p  and  population  standard  deviation  a  respectively. 

Example  6.1.  Let  a  random  variable  X  have  the  exponential  dis¬ 
tribution  with  parameter  9. 

f  (x; 0 )  =  0e  0X,  x  >  0. 

The  likelihood  function  L(0)  of  X, ,  X_,  ...»  X  is 

12  n 
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L(0) 


tt  f  (x.  ;0) 
i=l  1 


o  -0x. 
it  9e  1 


„  -e . e.x. 
0ne  1=1  \ 


dL(0)  _  _nn-l  _-0Ex;  „nr  -0Exj 
— g-Q —  -  n0  e  i  -  0  Ix.e  i 


=  9"-1  e-6Exi  (n-eixp  -  0 


if  and  only  if  n  -0  E  x.  =  0. 

i=l  1 


i.e.  0  = 


E  X. 
i=l  1 


Therefore,  M.L.E.  of  0  is 


A  _  n  1  ,  — 

0_  =  — -  =  —  ,  where  X 

n  n  -  n 

E  X.  n 

i=l  1 


Example  6.2.  Let  X  ~  N(y,o),  where  the  value  of  a  is  known-. 


n  n  i  ,/x.-yy 

=  tt  f(x.;y)  =  tt  — ±—  exp  -  i M— 

i=l  1  i=l  a/2?  a  ' 


-  (-i-)"  exp  -  §  ;  (^)2 . 

\a/2?/  2  i=l'  a  > 


dL(y) 

dp 


—  I-.- 


Hence  d'dyU)  =  0  if  and  only  if  E  (x..-y)  =  0  if  and  only  if 
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U  =  —  Z  x.  =  x  .  Therefore,  y  =  ^  Z  X.  =  X  is  M.L.E.  of  y 
n  in  n  n  1  n 

It  can  be  easily  shown  that  £n  is  also  unbiased,  consistent  and 

minimum  variance  estimator  of  y.  The  fact  that  the  distribution 

of  the  estimator  j  =  X  is  N(y,o/*^n)  follows  from  the  following 

n  n 

theorem. 

Theorem  6.1.  If  X  ~  N(y,o),  then  Xn  ~  N(y,a//n). 

In  fact,  when  sample  size  n  is  sufficiently  large  (n  >  30), 
the  sample  mean  Xn  has  always  an  approximately  normal  distribu¬ 
tion  regardless  of  the  shape  of  the  population  distribution. 


Theorem  6.2.  (Central  limit  theorem)  Let  a  random  variable  X  have 
a  finite  mean  u  and  standard  deviation  a  .  Then  the  sample 
mean  Xn  has  an  approximately  normal  distribution  with  the  mean  jj 
and  the  standard  deviation  cr/Vn-  for  sufficiently  large  n  regard¬ 
less  of  the  p.d.f.  of  X. 

•The  following  Figure  6.1  illustrates  the  fact  that  the  dis¬ 
tribution  of  X^  approaches  the  normal  probability  density  func¬ 
tion  as  n  increases  when  the  population  distribution  are  either 
an  exponential  or  uniform  distribution. 

The  central  limit  theorem  is  very  useful  in  statistical 
inferences  because  the  unknown  population  mean  y  is  usually  esti¬ 
mated  by  the  sample  mean  X^  and  the  distribution  of  is  approx¬ 
imated  by  a  normal  distribution  (for  sufficiently  n)  according  to 
the  central  limit  theorem. 

Linear  Regression  -  Data  Analysis 

For  two  random  variables  X  and  Y,  a  linear  relationship 
Y  =  aX  +  b,  where  a  and  b  are  some  real  constants,  can  be  inves¬ 
tigated  by  linear  regression  analysis. 
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Distributions  of  Xn» 


Suppose  that  we  have  a  random  sample  of  size  n  (x^,y^), 

(x2'y2*'  •**'  *Xn'yn}  taken  from  the  Pair  (X,Y).  It  is  desired 
to  obtain  a  straight  line  y  =  ax+b  which  fits  the  data  best.  In 
other  words,  the  constants  a  and  b  in  y  =  ax+b  should  be  esti- 


The  applications  of  this  linear  regression  technique  in 
composite  material  engineering  are 


(1)  Estimation  on  Weibull  parameters  (presented  in 
next  section) 

(2)  S-N  curve  fitting  (see  Example  6.4.) 

The  (population)  correlation  coefficient  p(X,Y)  between  two 
random  variables  X  and  Y  has  been  defined  in  Section  5.  Based 
on  the  data  (x^y^),  (x2,y2),  ...,  (xn»Yn)  the  sample  correlation 

coefficient  r  is  computed  as 


nZx. 


r  = 


i*i  -  EVE*i 


/nZX?  -  (Ev.  )2  /nZy?  -  (Zy,  )2 


which  is  an  estimate  of  p(X,Y). 


Example  6.3.  Let  X  and  Y  denote  one's  age  and  blood  pressure 
respectively.  Age  and  blood  pressure  data  for  five  persons  is 


X 

15 

27 

20 

45 

40 

Y 

110 

115 

113 

132 

135 

2 

x _ y _ x _ xy 


15 

110 

225 

1650 

27 

115 

729 

3105 

20 

113 

400 

2265 

45 

132 

2025 

5940 

40 

135 

1600 

5400 

147 

605 

4975 

18355 

h 

ii 

n 

ii 

E*i 

Eyi 

Zx2 

i 

ElVi 

nEXiyi  ~  Exi  •  Ey  i  _  5x18355  -  147x605 
nix2  -  (ZXi)2  5x4975  -  (147) 2 


2840 

3286 


.86 
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E  -Za  xi 


605  -  .86  X  147 


=  95.72 


.'.  y  =  ax  +  b  =  .86x  +  95.72 

Example  6.4.  The  fatigue  life  N  of  a  composite  material  is 
observed  under  applied  stress  levels  S  =  45,  55  and  65  KSI  and 
the  data  obtained  is  as  follows. 

Data  C. 


Since  our  form  of  fatigue  model  is  N  =  BS  ,  by  taking  the  LOG  of 
both  sides,  we  obtain 

LOG  N  =  a  LOG  S  +  b, 

where  b  =  LOG  B. 


n=10 


Total 


x  =  LOG  S 

y  =  LOG  N 

x2 

1.65 

7.05 

2.7225 

1.  65 

7.18 

2.7225 

1.74 

4.85 

3. 0276 

1.74 

5.  05 

3.0276 

1.  74 

5.26 

3.0276 

1.  74 

5.53 

3.0276 

1.  81 

3.42 

3.2761 

1.81 

3.82 

3.2761 

1.81 

3.87 

3.2761 

1.81 

3.98 

3.2761 

17.50 

50.01 

30.6598 

11. 6325 
11.8470 
8.4390 
8.7870 
9.1524 
9. 6222 
6.1902 
6.9142 
7.0047 
7.2038 

86.7930 
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:  10x86.793  -  17.5x50.01 
10x30.66  -  (17. 5)2 

-20.7 

_  50.01  -  (-20.7 ) x!7 . 5 

To  : 

LOG  N  =  -20.7  LOG  S  +  41.23. 


-7.245 
0.  35 
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7.  ESTIMATIONS  ON  THE  WE I BULL  DISTRIBUTION 

For  static  strength  tests  or  fatigue  life  tests  of  various 
composite  materials,  the  two-parameter  Weibull  distribution 
W(S,a)  is  widely  used  as  their  failure  distributions. 

We  note  again  that  if  X  -  W(3,a),  then  its  p.d.f.  is 
expressed  by 


f  (x;  B , a)  -  |  (f)°  1  exp  -  [§]“,  x  >  0. 


(A)  M.L.E.  (Maximum  Likelihood  Estimate) 

^2 '  *  '  ‘  ke  3  random  sample  of  size  n  from  the 

W(6,a)  Weibull  distribution.  The  likelihood  function  L(3,a)  is 

n 

L  ( 8 ,  a )  =  7T  f  (x.  ;6,a) 


fa  \n  n  /xi\“"1  r  2  1 

■  (i)  W  expL-  J- 


The  M.L.E. ' s  8  and  a  of  3  and  a  should  be  the  solutions  of 
the  equations 

3L(Brtx)  _  A  9L(B,cx) 

98 - 0  and  — 9^ -  =  L*e- 


■,  n  1 1/a 
8  =  -  I  xa 

n  i=l  1 


Z  x .  £nx . 
i=l  1  1  i  n 

-  +  ±  r  Or,v 

Z  X?  "  A  1 

i=l  1 
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The  M.L.E.,  a,  of  a  can  be  obtained  by  an  iterative  proced¬ 
ure.  If  the  value  of  a  is  known  (or  estimated),  then  the  M.L.E., 
$n,  of  B  is 


A 

^n  " 


.  n 

-  Z  X? 
n  i-1  1 


1/a 


The  probability  distribution  of  8  is 


n 


2n 


n 


d  2  ,  _  . 
~  X  (2n) 


(B)  Graphical  Plotting  (Linear  Regression) 

Let  a  breaking  strength  (or  fatigue  life)  characteristic  X 
have  a  Weibull  distribution  with  the  parameters  B  and  a.  Its 
c. d. f .  is 


F(x) 


1  -  exp 


x  >  0. 


The  linear  regression  method  can  be  applied  to  estimate  the 
parameters  B  and  a. 

Taking  the  £n£n  transform  of  F(x),  we  have 
£n£n  Y-F{x)  =  aiinx  “  a£n8. 

Suppose  that  we  have  tested  n  specimens  and  obtained  data  x^,  X£, 
xn  from  the  W(B,a)  distribution. 

Let 

y  =  £n£n  — Jr- T  / 
l-F(x) 

z  =  £nx  , 
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b  =  -a£nB 


Pi  -  1+n  /  i~l»  2, . . .  ,n, 

y i  —  H n £ n  ^_p  >  i=l»2»...,n» 
pi 

z^  =  In  x(jj»  i=l,2,...,n, 

where  x^,  x(2)'  ***'  x(n)  are  t*le  or<^er  statistics  (the 
arrangement  of  x^,  x^t  .  ..,  xn  in  increasing  order)  of  x^,  x^, 

•  •  •  /  X  • 

n 

Based  on  the  observations  (z,,y,),  (z^y-),  ...»  (z  ,y  )  , 
a  linear  regression  line  y  =  az+b  can  be  estimated  and  a  and  8  = 
exp  - [b/a]  are  the  estimates  for  a  and  B. 

Data  D.  Tensile  strength  of  AS/3501-5A/16  ply/0°  in  KSI  (n=8) 

152  196  200  207 

214  220  226  244  * 

Example  7.1.  Assume  that  Data  D  came  from  a  Weibull  distribution 
W(8,ct) . 


(a)  Estimation  of  B  and  a  by  Graphical  Plotting; 


rank 

(i) 

X(i) 

Pi 

1 

zi 

=ln  xa) 

^i 

2 
z . 

l 

z .  y . 
i 

i-Pi 

x»  n  X/  xi  ^ 

!-Pi 

1 

152 

1/9 

9/8 

5.02 

-2. 139 

25.20 

-10.783 

2 

196 

2/9 

9/7 

5.28 

-1.381 

27.88 

-  7.292 

3 

200 

3/9 

9/6 

5.30 

-1. 903 

28.09 

-  4.786 

4 

207 

4/9 

9/5 

5.33 

-  .531 

28.41 

-  2.830 

5 

214 

5/9 

9/4 

5.37 

-  .210 

28.84 

-  1.128 

6 

220 

6/9 

9/3 

5.39 

.  094 

29.05 

.  507 

7 

226 

7/9 

9/2 

5.  42 

.408 

29.38 

2.211 

8 

244 

8/9 

9/1 

5.50 

.  787 

30.25 

4.329 

Total 

42.61 

-3.875 

227.10 

-19.727 

6.14 


50 


8.  CONFIDENCE  INTERVALS 
(A)  For  Normal  Distributions 

We  start  this  section  by  giving  an  example  to  motivate  and 
understand  the  concept  of  confidence  intervals. 

Example  8.1.  Suppose  that  a  measurement  characteristic  X  has 
N(u,o)  distribution,  where  p  is  unknown  and  a  is  a  known  value. 

On  the  basis  of  a  random  sample  X^  X2,  ...»  Xn  from  X,  we  know 

1  n 

that  the  sample  mean  X  =  —  E  X.  is  an  estimator  of  the  unknown 

n  n  .  .  i 

parameter  p  and 

Xn  ~  N(p,o//n),  i.e. 


Hence 


Now 


X  —  p 

— -  ~  N  (0, 1)  . 

o//n 


-1.96  0  1.96 


(Note  that  z  q2,.  =  1.960  from  Example  4.2.) 


-1.96  <  — -  <  1.96, 

a//n 


-1.96  —  <  X„  -  p  <  1.96  —  , 
‘  n  " 
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1.96  —  <  -y  <  -X  +  1.96  —  , 
/n  '  n  /E 


1.96  —  <  y  <  X  +  1.96  —  , 
/n  "  /n 


.*.  Pr  X  -  1.96  —  <  y  <  X  +  1.96  —  =  .95 
r  n  /S  ‘  "  n  /n 


It  shows  that,  with  the  probability  .95,  the  population 
mean  y  is  between  X  -  1.96  —  and  X  +  1.96  —  . 

n  /H  n 


Suppose  that  a=5,  then 


Pr{Xn  -  1.4  <  u  <  Xn  +  1.4}  =  .95. 


The  interval  (X  -1.4,  X  +1.4)  is  a  95%  confidence  interval  for  y. 
n  n 

If  we  have  taken  a  random  sample  X^,  X 2,  ...,  Xfi  and  obtained 
Xn=7.5,  for  n=49,  then 


Xn-1.4  =  7. 5-1. 4  =  6.1, 


X  +1.4  =  7. 5+1. 4  =  8.9, 
n 


(6. 1,8. 9)  is  one  estimate  of  95%  confidence  interval  for  y. 

" (X^-l. 4 ,  Xn+1.4)  is  a  95%  confidence  interval  for  y"  means  that 

if  we  obtained  100  copies  of  X  ,95  copies  of  (X  -1.4,  X  +1.4) 

n  n  n 


contain  the  population  mean  y. 


X 

n 

(Xn-!.4, 

Xn+1*  4) 

7.5 

(6.1, 

8.9) 

7.2 

(5.8, 

8.6) 

8.1 

(6.7, 

• 

9.5) 

6!  3 

(4.9,* 

7.7) 

[xxxxxxxxx] 

[x  X  X  X  X  X  XXX  x] 

[x  x  ;x  x  x  x  x  xx] 

[xxxxxxxxt] 
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Definition  8.1.  Let  X^,  X2,  . ..,  X^  be  a  random  sample  of  size  n 
from  a  population  distribution  f(x;0),  where  0  is  unknown  parameter. 
f£ (X^» X2 , . . • , Xn> i  0 (X^, Xj # • • . » X  ) ]  is  called  a  (1-v)  100%  confi¬ 
dence  interval  for  0  if 

pr{i(xi'x2"  *  ’  'Xn}  -  9  -  6  (X1,X2, .  . .  ,Xn)  } 

=  1-v. 

We  note  that 

(1)  the  lower  limit  d^fX^/X^r .  •  •  ,X^)  and  the  upper  limit 
0 (X^/Xj# . . . »Xn)  are  both  functions  of  the  random  sample  X^,X2, 

... ,xn. 

(2)  In  Example  8.1, 

0 (X. , X- , . . . , X  )  =  x  -  1.96  —  , 

-  1  2  n  n  ^ 

F (X. , X_ , . . . , X  )  =  X  +  1.96  —  . 

1  ^  n  n  /n 

(3)  Possible  values  for  "v"  are,  usually 

v  1-v  (1-v)  100% 

.01  .99  99% 

.02  .98  98% 

.05  .95  95% 

.10  .90  90% 

Theorem  8.1.  If  X  -  N(y,a)  with  a  known,  then  (1-v)  100%  confi¬ 
dence  interval  for  y  is 
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a 


[x  -  Z  —  ,  X  +  z  — 1. 

Ln  ^  n  j/hJ 


Example  8.2.  Suppose  that  X  ~  N(u»o=5)  and  Xn  =  70  computed  from 
25  samples.  To  find  90%  confidence  interval  for  y; 


v  =  .10  ,  -£  =  .05,  Z  =  Z  .  =  1.645 

/.  v  .  Uo 


U  =  X  -Z  — =70-  1.645  x  —  =  68.355 
n  \  /H  /25 


U  =  X  +  Z  — =70+  1.645  x  -5-  =  71.645 
n  \  Vn  /25 


(68.355,  71.645)  is  a  90%  confidence  interval  for  yi. 


(B)  For  Weibull  Distributions 

Suppose  that  X  ~  W(6,a)  with  a  known  and  B  unknown  parame¬ 
ters.  It  has  been  shown  that 


a  f i  n  aT 
$  =  -  l  XT 

n  |_n  i=i 


is  M.L.E.  of  8  and 


(*)  2n&Vs  x2(2n). 


0  *1-V2 
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Theorem  8.2  (Two-sided) 


is  (1-v)  100%  confidence  interval  for  B. 


\a 

Proof:  (2n)  <  2n(^- 

1-2 

i  1  2 

-J  <  (2 n)  =  1-v  from  (*)  and  sim- 

2 

plify  the  equation. 

Theorem  8.3.  (one-sided) 

Proof:  prj2n(jf/  <  *^(2n)j  =  1-v  and  simplify  the  equation. 

Example  8.3.  Continuation  of  Example  7.1. 

From  Data  D,  =  213.86,  a  =  6,  and  n  =  8.  95%  confidence 

interval  for  6  is. 


(a)  Two-sided; 


1-v  =  .95,  v  =  .05,  ~  =  .025,  1-J  =  .975 
2n  =  16 

*?025(16)  =  28*8'  X?975  (16)  =  6*91 

(from  Table  II) 

1/aA  /  lfi  \V6 

Br,  '  (iSTsJ  213  •  86  -  193 ' 91 


245.98 


.'.  (193.91,  245.98)  is  95%  confidence  interval  (two- 

sided)  for  B. 

(b)  One-sided; 

1-v  =  .95,  v  =  .05,  2n  =  16 
x2q5(16)  =  26.3  (from  Table  II) 

1/aA  /  16  \1/6 

Bn  =  213‘86  =  196‘86 


.’.  (196.86,  °°)  is  95%  confidence  interval  (one-sided) 

for  8. 

(C)  Applications  in  Probabilistic  Design 

Suppose  that  the  breaking  strength  (or  fatigue  life)  X  of 
a  composite  material  has  a  Weibull  W(8,a)  distribution  with  a 
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value  known.  On  the  basis  of  test  data  x.  ,  x_,  . ..,  x  from  the 

■i-  “  n 

population  X,  the  unknown  parameter  3  is  estimated  by  the  M.L.E. 
a  Ti  n  al1/01 

en  =  n  .^i  xi  ‘  The  lower  of  one-sided  95%  confidence 


interval  for  8  is  computed  as  6  =  f  ■  - 

‘  V?05‘2"> 


)W\ 


so  that 


pr  (B  >  8  }  =  .95. 


Note  that  6L  is  a  very  "conservative"  estimate  of  8  and  can 
be  used  for  computing  design  strength  (or  life).  if  x  ~  W(£T/a), 
then  the  design  strength  (or  life)  x  can  be  related  to  the  reli¬ 
ability  as 

Reliability  R  =  P  {X  >  x)  =  exp  -  /—  \ 

\K) 

A  (  1  \  a 

if  and  only  if  x  =  8^ ^£n  .  The  following  design  values  are 

defined  for  the  reliability  R  =  .99  and  R  =  .90. 


Definition  8.2. 


xA  (A-allowable) 


x0  (B-allowable) 


(en .  99) 

i1"  Tot)" 


Example  8.4.  Continuation  of  Example  8.3. 

(b)  8l  =  196.86  and  a  =  6. 


A-allowable;  xA  =  196.86  x 


B-allowable;  xfi  =  196.86  x 


(*»  75?)1 


=  91.45 


135.29  . 


9. 


TESTING  HYPOTHESES 


It  is  often  the  purpose  of  a  statistical  inference  to  answer 
a  "yes"  or  "no"  question  about  some  characteristics  of  a  popula¬ 
tion. 

Suppose  that  a  measurement  random  variable  X  has  a  p.d.f. 
f(x;8)  with  unknown  0.  On  the  basis  of  a  random  sample  x^,X2, 
...,xn  from  f  (x; 0 ) ,  it  is  desired  to  decide  whether  a  hypothesized 
proposition  on  8  is  true  or  false.  Suppose  that  two  contradictory 
states  of  nature  are  represented  by  two  hypotheses, 

null  hypothesis  ;  0  =  0^ 
alternative  hypothesis  H^ ;  0  =  0^ 

and  a  decision  should  be  made  to  reject  H^  (accept  H^)  or  to 
accept  Hq  (reject  H^) . 

Two  possible  errors  exist  in  this  decision  making  process; 

Type  I;  reject  Hq  when  HQ  is  true 
Type  II;  accept  HQ  when  is  true  . 

The  null  hypothesis  Hq  should  be  the  one  which  corresponds 
to  the  state  of  nature  that  one  would  not  eliminate  unless  there 
is  a  strong  evidence  to  do  so.  It  follows  then  that  the  Type  I 
error,  "reject  H^  when  H^  is  true"  is  the  more  serious  error. 

The  testing  procedure  is  then  in  such  a  way  that  the  prob¬ 
ability  of  Type  I  error  (the  worst  error)  should  be  less  than  a 
small  prescribed  probability,  say  0.1,  0.05,  0.02  or  0.01,  and 
the  probability  of  Type  II  error  should  be  minimized. 

The  level  of  significance  v  (v  =  .1,  0.05,  0.02,  or  0.01) 
is  defined  to  be  the  maximum  probability  of  the  Type  I  error,  i.e. 

Pr(Reject  HQ |HQ  true}  <  v. 

The  testing  of  the  hypotheses  is  then  to  find  a  region  for 
"rejecting  HQ",  which  is  called  a  "critical  region",  so  that 

Pr (Reject  Hq | Hq  true}  =  Pr(critical  region |Hq  true}  <  v. 


The  critical  region  can  be  expressed  in  terms  of  a  statistic 
6  =  6 (X1/X2, . . . ,Xn)  a  function  of  a  random  sample  Xj,X2,...,X 
from  f(x;0),  which  is  related  to  the  parameter  8  that  the  hypo¬ 
theses  are  stated  on. 

Example  9.1.  Suppose  that  a  composite  material  strength  X  has 
N  (y ,  10 )  distribution  and  it  is  desired  to  test, 

Hq;  u  =  90 

H^;  y  >  90  with  v  =  .05. 

On  the  basis  of  nine  samples  of  tests,  the  sample  mean 
Xn  =  93.2  is  obtained. 

Note  that  we  reject  HQ  (accept  if  and  only  if  we  have 
Xn  >  a  ^or  some  real  number  a. 

In  order  to  find  a  critical  region,  i.e.  to  find  the  value 

of  a, 


Pr (Reject  HQ|H0  true}  =  pr{Xn  >  a|u  =  90} 

X_-90  _  )  /  .  / 

(since  X  d  10 


=  P 


n 


a- 90 


10//9  10//9 


\ 


n 


according  to  Theorem  6.11 


=  P 


Z  > 


a-90 

10//9 


=  . 05  =  v,  i.e. 


Yq/2  =  (from  Table  I)  and 

a  =  90  +  1.645  x  ^  =  95.483, 

hence  we  have  a  critical  region; 

Reject  Hq  if  and  only  if  Xn  >  95.483. 
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Therefore  our  conclusion  on  the  basis  of  X  =  93.2  is  then  to 

n 

"accept  H  " . 


Example  9.2.  Suppose  that  X  ~  W(8;a=10)  and  it  is  desired  to 
test 


Hq ;  6  =  100 

H1;  6  >  100  '  with  v  =  0.05. 


On  the  basis  of  nine  data  points,  we  obtained 


6  -  T1  £  Xal1/a  -  l1  ?  x10 

89  ‘  LE  i=i  XiJ  ■  b  i=i  *  J 


1/10 


=  106.3 


/6  \a 

Under  HQ ,  2n Upj  =2x9 


(A  x 

A_V 

100/ 


'  X2(18)  s°  Pr  18(lfoj  2  /os'181 


=  .05,  i.e.  x2nl-(18)  =  28.87  (from  Table  II)  and  P  {£  >  104.84} 

• uj  r  9  - 

=  .05.  Hence,  reject  HQ  if  and  only  if  £  >  104.84.  Therefore, 

our  decision  is  to  "reject  H0". 

Chi-Square  Goodness  of  Fit  Tests 

On  the  basis  of  a  random  sample  x^Xj,  .  .  .  ,x  ,  it  is  often 
desired  to  know  "what  (family)  is  the  population  distribution 
f(x;0)".  Is  it  Weibull,  Normal  or  something  else? 

To  test  hypotheses, 

Hq,*  f(x;0)  is  Weibull  (Normal,  etc.) 

H^;  f  (x ; 0 )  is  not  Weibull 


with  the  level  of  significance  v,  we  use  the  Chi-square  goodness 
of  fit  test; 
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Classes 


a0‘ai 

ara2 


Observed 

Frequency 


Expected 

Frequency 


a  ,  -a 
m-1  m 


m 


m 


n  =  sample  size 
m  =  number  of  classes 
f.  =  number  of  samples  between  a.  ,  and  a. 

i- 1  i 

ei  =  n  x  where  p^  =  Pr{a^_^  <  X  <  a^}  and  the  random  variable 
X  has  a  distribution  given  under  Hq. 

2 


.  .  2  m 

The  statistic  \  —  Z  - — -  measures  the  difference 


i=l  ei 


between  observed  and  expected  frequencies  and  under  null  hypothe¬ 


sis  Hq/ 


X2  ~  X2 (m-r) , 


where  r  =  1  +  {number  of  parameters  estimated  to  compute 


the  expected  frequency  e.^ } . 
Now  reject  HQ  if  x2  >  *2(m-r) 
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H^:  Data  B  came  from  a  Weibull  distribution  v  =  .05 
n  =  36,  a  =  13  and  g  =  213 


classes 

frequency  f ^ 

Pi 

expected  frequency 

e .  =  n  x  p . 
i 

less  than 

159.5 

2 

.023 

.  83 

159.5  - 

169.5 

0 

.  027 

.97 

169.5  - 

179.5 

0 

.  053 

1.89 

179.5  - 

189.5 

2 

.  094 

3.38 

189.5  - 

199.5 

5 

.151 

5.  44 

199.5  - 

209.5 

9 

.206 

7.42 

209.5  - 

219.5 

6 

.  219 

7.85 

219.5  - 

229.5 

8 

.  157 

5.  64 

229.5  - 

239.5 

3 

.  061 

2.21 

more  than 

239.5 

1 

.  010 

.  37 

m  =  10,  r  =  3 


•> 

x' 


10 

I 

i=l 


4.695 


x205(7)  =  14.1  (from  Table  II). 

Reject  Hq  if  x2  >  14.1. 

Our  decision  is  to  accept  HQ. 
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10.  SUMMARY 


(a)  A  two  parameter  Weibull  distribution  W($,a)  is  used  for 
representing  the  static  strength  or  fatigue  life  dis¬ 
tributions  of  composite  materials. 

(b)  Hahn-Kim  and  Yang's  residual  strength  degradation  model 
was  presented  as  a  fatigue  model  of  composite  materials. 

(c)  A  graphical  plotting  and  a  maximum  likelihood  estimation 
method  was  introduced  for  estimating  the  parameters  of 
the  Weibull  distributions.  The  95%  lower  confidence 
bound  of  B  was  obtained  for  design  purposes  (A-allow- 
able  and  B-allowable) . 

(d)  The  problem  of  testing  hypotheses  on  the  Weibull  scale 

2 

parameter  B  was  solved  (x  -test)  when  the  shape  param¬ 
eter  a  is  known. 

(e)  Some  additional  results  on  the  estimations  and  testing 
hypotheses  involving  the  Weibull  distributions  are 
needed  to  be  investigated  and  summarized  for  the 
applications  in  composite  materials  engineering. 
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